Comparing n-gram-based functional categories in original versus translated texts
نویسندگان
چکیده
منابع مشابه
a corpus-based study of the frequency of personal pronouns in translated and comparable non-translated persian texts
چکیده ندارد.
15 صفحه اولLanguage Models for Machine Translation: Original vs. Translated Texts
We investigate the differences between language models compiled from original target-language texts and those compiled from texts translated to the target language. Corroborating established observations of Translation Studies, we demonstrate that the latter are significantly better predictors of translated sentences than the former, and hence fit the reference set better. Furthermore, translat...
متن کاملGraph-Based N-gram Language Identification on Short Texts
Language identification (LI) is an important task in natural language processing. Several machine learning approaches have been proposed for addressing this problem, but most of them assume relatively long and well written texts. We propose a graph-based N-gram approach for LI called LIGA which targets relatively short and ill-written texts. The results of our experimental study show that LIGA ...
متن کاملRelevance Ranking for Translated Texts
The usefulness of a translated text for gisting purposes strongly depends on the overall translation quality of the text, but especially on the translation quality of the most informative portions of the text. In this paper we address the problems of ranking translated sentences within a document and ranking translated documents within a set of documents on the same topic according to their inf...
متن کاملReadability Assessment of Translated Texts
In this paper we investigate how readability varies between texts originally written in English and texts translated into English. For quantification, we analyze several factors that are relevant in assessing readability – shallow, lexical and morpho-syntactic features – and we employ the widely used Flesch-Kincaid formula to measure the variation of the readability level between original Engli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Corpora
سال: 2018
ISSN: 1749-5032,1755-1676
DOI: 10.3366/cor.2018.0153